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ASIC CLOCK FLOOR PLANNING 



METHOD AND STRUCTURE^ 1Q/ 539334 



Technical Field 



The field of the invention is that of designing and constructing a clock tree in 
an integrated circuit 

Background Art 

In the field of integrated circuits, a constant need is that of distributing the 
clock signal. As clock speeds have increased, the tolerances on clock skew 
have tightened. 

A number of commercial products are on the market to assist in the design 
process. Cadence offers a program, Clock Tree Synthesis (TM), which 
generates a tree with limits on the size of a buffer and/or on the complexity of 
the floor plan that the tree can accommodate. Synopsis offers a program, 
Clock Tree Compiler (TM) that is generally similar. 

IBM has a program, using a method described in U.S. Patent 6204713, which 
can handle large buffers, which permits a tree with fewer levels. 

The problem in generating clock trees can be described generally as striking 
a balance between delay, power consumed and ability to accommodate 
circuit blocks that have had their designs frozen. A short delay in the clock 
chain helps control skew. Low power is always desirable and the ability to re- 
use predesigned circuit blocks (embedded processors, RAMs, I/O, etc.) 
without redoing the layout to accommodate the clock saves greatly on cost in 
developing an ASIC. 
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Disclosure of the Invention 

The invention relates to a method of designing a clock tree that 
accommodates an existing layout while selecting clock buffer size and 
orientation to match the floorplan. 

Accordingly, a method and corresponding article of manufacture are provided 
for designing a clock tree In an integrated circuit comprising: 

collecting a set of sink locations 3 in a master list and a set of blocked areas 
47; 

(a) selecting a temporary insertion point (TIP) 4, 5; 

(b) enclosing the sink 41 at the first level furthest from the TIP in a bin 40 that 
includes a first subset of sinks 43 and remove the first subset.from the master 
list; 

(c) assigning a first-level structured clock buffer (SCB) 42 to the bin; 

repeating steps (a), (b) and (c) above for the remaining sinks in the first level 
of buffers and subsequent levels until the root level is reached; 

improving the symmetry of the tree by moving SCB 45 locations within 
constraints 46 to concentrate SCBs in rows and columns; 

connecting the root level TIP 610 to lower levels; and 

connecting a source (S) of clock signals to the root level TIP. 

Brief Description of The Drawings 

Figure 1 lists steps in a preferred embodiment of the invention. 
Figure 2 shows a sample set of sink locations. 
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Figure 3 shows an example of grouping the sinks in bins served by a buffer. 
Figures 4 and 5 show initial and final locations of a set of buffers. 
Figure 6 shows a completed tree. 

Best Mode for Carrying Out the Invention 

The invention is intended to be used in the design of integrated circuits,* 
especially ASICs, where a short design process is an important part of 
keeping costs down. In the course of designing large-scale (30,000,000+ 
gates) ASICs, the process of routing the clock tree around the pre-designed 
modules that form the bulk of the ASIC consumes a significant amount of 
time and cost. ' 1 

Among advantages that may arise by application of the invention are the 
following: 

-automatic or manual selection of a reference point (or temporary 
insertion point (TIP)) that starts the branching process; 

-flexibility in range and aspect ratio of buffers; 

-choice of either one or both of top-down or bottom-up methods; 

-provision of trees having buffers of different sizes; 

-allowance for the user to intervene in the design; 



The sequence of steps according to a preferred embodiment of the invention 
is: 

collect the clock sink locations and blockage dimensions. 
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pick the TIP for the lowest (leaf ) level 

group the sinks in bins, starting with the one furthest from the TIP and place a 
first-level buffer in each bin 

pick a TIP for the first level 

group the first-level buffers in bins, starting with the one furthest from the TIP 
and place a second-level buffer in each bin 

repeat the TIP selection and binning step until the root level is reached 

improve the symmetry by moving buffers within constraints to concentrate 
buffers in rows and columns 

add buffers from the source to the TIP, and 

insert the tree in the netlist. 

Figure 2 illustrates a simplified example of an integrated circuit being 
processed according to the invention. At the center of the figure, filled 
rectangle i represents the TIP. Hollow rectangles 3 represent clock sinks, 
which is a general term for the interface between the tree and the logic. The 
logic surrounding the interface points is omitted for clarity in presentation. 
The diamond on the left side represents the source of the clock signals. 
Capital C, 4, represents a point calculated by taking the mean between the 
extreme coordinates of all the sinks - e.g. x = (xmax + xmin)/2 and y = (ymax 
+ ymin)/2. The capital C\ 5, represents the centroid calculated by taking the 
mean of all the sink coordinates. These sinks will be grouped and assigned 
to a buffer that distributes the clock signal. 

Buffers according to the invention, referred to as structured clock buffers 
(SCBs) are circuits built of N parallel kernels along an axis, where N is greater 
than or equal to one. These buffers can be inverting or Non-Inverting. Outputs 
of the kernel are tied together. Balanced wiring is used to wire the circuits of 
the Structured Clock Buffers. U.S. Patent 6204713, incorporated by 
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reference, explains the design and construction of such buffers. 

It is an advantageous feature of the invention that the buffers are not 
restricted to a layout along a single axis (horizontal or vertical) but that layout 
along both axes may be provided, in order to permit a buffer to fit in a limited 
5 space. As those skilled in the art are aware, circuit layout is conventionally 

done with the axis of the transistors lined with one of the coordinate 
directions. 

Designing a vertical buffer that fits within a layout that was intended to be 
10 horizontal is non-trivial and requires that the transistors in the vertical SCBs 

must be in same orientation as the horizontal SCBs. In addition, balanced 
wiring is needed for both orientations. Power rings are needed for either the 
vertical or horizontal SCB to account for differences in the vertical or 
horizontal power distribution. This may require an additional wiring level for 
15 vertical or horizontal SCBs in order to fit in additional wires. Layouts are 

modified so that each horizontal SCB has a delay and output drive capability 
that match a corresponding vertical SCB. 

The TIP can be calculated automatically by calculating the delay, power and 
placability at the center of the sinks (C), the centroid of the sinks (C) and at a 

20 set of intermediate points between C and C\ The selection rule is 

summarized as: If the center of the sinks is closer to the source than the 
centroid is, then the TIP is placed at the center. If there is a block (a module 
that cannot be re-drawn) at the center, then the TIP is placed at a different 
location, either the centroid or at one of the intermediate points between the 

25 center and the centroid. 

The placability algorithm produces a result of zero if the location is blocked. 
The best score is generated by an unblocked location large enough for the 
SCB, and closest to the desired placement point. The score decreases as 
the distance from the desired point increases, reflecting the undesirability of 
30 additional distance. 

The TIP point can be determined by calculating a score that is the sum S1 W1 
+ S2W2 + S3W3, where S1 , S2 and S3 are 0 or 1 , depending on which 
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aspect the designer wishes to emphasize and W1 = K1*Delay, W2 = 
K2*Power and W3 = K3*Placability, where Ki is determined empirically 

Once the TIP has been placed, the sinks at the lowest level of the clock tree 
are grouped. A rectangle (bin) is placed, just enclosing the sink farthest from 
5 the TIP. The initial size of the bin is set empirically, based on experience with 

the density of sinks. The bin may be adjusted up or down to approach the 
desired size. Illustratively, a bin holding about 20 The actual number is 
determined by the drive capability of the SCB and the wiring delays to get to 
the sinks. 

10 Similarly, the number of levels in the tree will be set empirically, e.g. at three. 

Again the actual number is determined by the drive capability of the SCB and 
the wiring delay's to get to the sinks. More levels can be added for larger 
trees. 

A buffer is placed at the center of the rectangle, which may be moved slightly 
15 in the x and/or y directions to accommodate the maximum number of sinks. 

The size of the buffer is set to be able to drive the sinks in the bin. It is an 
advantageous feature of the invention that the size of the buffers may be 
selected from a set of pre-designed buffers in order to drive the number of 
sinks that happen to be in the bin, rather than cutting down the size of the bin 
20 to accommodate the buffer. The sinks included in that bin are deleted from 

the master list of sinks and the next bin is placed. 

Fig 3 has been drawn to illustrate a situation in which a pair of modules 45 
and 47 are located such that a horizontal buffer will not fit. According to the 
invention, the buffer is moved toward the TIP to try to find a location that fits 

25 and still permits the buffer to drive the sinks. If such a location is not found, a 

vertical buffer is placed. If a vertical buffer will not fit, the program places a 
horizontal buffer in the original location, even if it is overlapping fixed blocks, 
printing a warning message for the user to correct it manually. Such a 
situation is illustrated with buffer 42. The sequence is that the program will 

30 exhaust the possibilities in the horizontal, and then exhaust the possibilities in 

the vertical then warn the user if no placement is possible. 
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Referring now to Fig 3, there is shown a set of three SCB areas represented 
by boxes 20, 30 and 40 holding sink subsets 23, 33 and 43 and positioned 
around SCBs 22, 32 and 42, respectively. The dimensions of a box represent 
the distance over which a clock signal can be transmitted by the SCB at the 
5 center of the box within the restrictions on skew, etc. set by the circuit 

designer. Illustratively, the boxes have been shifted to have a boundary that 
is next to the extreme position of a sink - i.e. box 20 has been placed so that 
the sink 21 , furthest from the TIP is at its edge; similarly for box 30 and sink 
31 and for box 40 and sink 41 . 

10 According to the invention, the size of the SCB will be set to drive the sinks 

within its range. It is an advantageous feature of the invention that the size of 
a buffer is flexible. 

Once the first bin is set, the sink remaining in the master list that is furthest 
from the TIP is located and a bin is placed enclosing it. The preceding 
15 procedure is repeated until all the sinks have been placed in bins. This ' 

covers the leaf level of the clock tree. 

The binning procedure is repeated, using first and second level buffers in 
place of leaf-level sinks, until the root level of the tree is reached. 

Once the sinks have been allocated to SCBs, a process of improving the 
20 symmetry of the tree can be performed. Fig. 4 shows a simplified example of 

a layout in which three SCBs 142, have been fixed for one reason or another 
and will not be moved in this step. Examples of a fixed SCB may be a 
customer pre-placing and assigning one SCB to some high speed critical 
circuits (i.e. high speed interface). Another case might be one where part of 
25 the chip already meets the timing requirements and the customer does not 

wish to revisit the compromises made to get that result. 

Blocks 47 represent layout modules that will not be moved. The clock tree 
must be routed around them. The other SCBs, labeled 45, can be moved. In 
this example, there is a 10x10 grid and a rule that an SCB can be moved by 
30 only one step vertically or horizontally. Possible moves are indicated by the 

arrows. 
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The principle of the move is to concentrate SCBs on the same row or column, 
thereby providing a more compact distribution net and simplifying skew and 
latency. 

The procedure followed is to calculate, for each row and column, the number 
of SCBs in that row or column, and the number that could be included within 
the rules. The numbers on the side and bottom indicate the actual number 
and potential number of SCBs in that row or column. 

Fig. 5 shows the final configuration of SCBs after allowed moves have been 
made. The effect of the moves has been to concentrate the horizontal 
distribution in four rows, with the vertical distribution being spread more 
broadly. 

Fig. 6 shows the result of connecting the SCBs from Fig. 5. Leaf-level buffers 
are denoted with numeral 630. Second-level buffers are denoted with 
numeral 620 (located in the middle, about one third from the bottom and from 
the top). The root level buffer is denoted with numeral 610, located at the 
center. Location of the root at or near the center is not required, though it 
contributes to symmetry and a compact layout. The location of the source is 
not shown in this Figure. It will be located at a fixed position and a route 
constructed from it to SCB 610. 

The temporary insertion point, which is used in each level of the clock tree 
looks at both its sink circuits (bottom of tree) and the source circuit for those 
sinks (top of tree). 

While the invention has been described in terms of a single preferred 
embodiment, those skilled in the art will recognize that the invention can be 
practiced in various versions within the spirit and scope of the following 
claims. 

Industrial Applicability 

The invention has applicability to the design and fabrication of integrated 
electronic circuits. 
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